Semantic Information Retrieval from Distributed Heterogeneous Data Sources
نویسندگان
چکیده
Information retrieval from distributed heterogeneous data sources remains a challenging issue. As the number of data sources increases more intelligent retrieval techniques, focusing on information content and semantics, are required. Currently ontologies are being widely used for managing semantic knowledge, especially in the field of bioinformatics. In this paper we describe an ontology assisted system that allows users to query distributed heterogeneous data sources by hiding details like location, information structure, access pattern and semantic structure of the data. Our goal is to provide an integrated view on biomedical information sources for the Health-e-Child project with the aim to overcome the lack of sufficient semantic-based reformulation techniques for querying distributed data sources. In particular, this paper examines the problem of query reformulation across biomedical data sources, based on merged ontologies and the underlying heterogeneous descriptions of the respective data sources.
منابع مشابه
Retrieval of the most relevant combinations of data published in heterogeneous distributed datasets on the Web∗
Finding the most relevant data items among heterogeneous data published on the Web is getting a growing attention in recent years. Retrieving the most relevant data items from a collection of data is a challenge addressed by top-k databases. Accessing heterogeneous and distributed data sources is a challenge addressed by the Semantic Web. How to combine methods and techniques from those two fie...
متن کاملA Semantic and Agent-based Approach to Support Information Retrieval, Interoperability and Multi-lateral Viewpoints for Heterogeneous Environmental Databases
Data stored in individual autonomous databases often needs to be combined and interrelated. For example, in the Inland Water (IW) environment monitoring domain, the spatial and temporal variation of measurements of different water quality indicators stored in different databases are of interest. Data from multiple data sources is more complex to combine when there is a lack of metadata in a com...
متن کاملDSRM: An Ontology Driven Domain Scientific Data Retrieval Model
With the development of information technology, a large number of domain scientific data have been accumulated with the characteristics of distribution and heterogeneity. It has important significance to acquire exact scientific data from multiple data sources for cooperative research. The existing data integration and information retrieval techniques cannot solve the problems of data semantic ...
متن کاملEnriching SQWRL Queries in Support of Geospatial Data Retrieval from Multiple and Complementary Sources
Finding relevant geospatial information is increasingly critical because of the growing volume of geospatial data available through distributed environments. It is also a challenge for the ongoing development of the Geospatial Semantic Web. Data brokers provide searchable repositories through which users can generally retrieve the requested data. But generally, these mechanisms lack the capacit...
متن کاملThe Light-Weight Semantic Web: Integrating Information Extraction and Information Retrieval for Heterogeneous Environments
Today’s Web, large intranets and even the documents collected by a single user are enormous sources of distributed, heterogeneous information that cannot be easily mastered. Syntactical and semantical differences as well as missing semantic annotations make effective query evaluation on such corpora a hard task. The Semantic Web aims at providing a standard for semantic annotations, but has not...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/0707.0745 شماره
صفحات -
تاریخ انتشار 2006